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PRODUCTION OF MATUBE PROTEINS 
IN TRANSFORMED YEAST 

BACKGROUND OF THE INVENTION 
1. Field of the Invention* 
5 This invention relates to a method for 

producing a mattire protein in transformed yeast 
and further relates to Saccharomyces cerevisiae 
transformed by a plasmid containing a 
prepropara thyroid hormone cDNA insert • 

10 2, Description of the Prior Art. 

Recombinant DNA technology now makes it 
possible to isolate specific genes or portions 
thereof from higher organisms, such as man and 
other animals, and to transfer the genes or fragments 

15 to a microorganism species, such as E^ col± or 
yeast. The transferred gene is replicated and 
propagated as the transformed microorganism may 
become endowed with the capacity to make whatever 
protein the gene or fragment encodes, whether it 

20 be an enzyme, a hormone, an antigen or an antibody, 
or a portion thereof. The microorganism passes 
on this capability to its progeny, so that in 
effect, the transfer results in a new strain, 
having the described capsdjility. 

25 Recombinant DNA conventionally utilizes 

transfer vectors. A transfer vector is a DNA 
molecule which contains genetic infoinnation 
which insures its own replication when transferred 
to a host microorganism strain. Plasmids are an 

30 exait^le of a transfer vector commonly used in 

genetics. Although plasmids have been used as the 
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transfer vectors for the work described herein, 
it will be understood that other types of transfer 
vectors may be employed, Plasmid is the teirm 
applied to any autonomously replicating DNA unit 



10 



15 



20 



25 



30 



which might be found in a microbial cell, other 
than the genome of the host cell itself* A 
plasmid is not usually genetically linked to the 
chromosome of the host cell, Plasmid DNA exists 
as doublestranded ring structures generally on 
the order of a few million daltons molecular 
weight, although some are greater than 10^ daltons 
in molecular weight. They usually represent only 
a small percent of the total DNA of the cell. 
Transfer vector DNA is usually separable from host 
cell DNA by virtue of the great difference in size 
between them. Transfer vectors carry genetic 
information enabling them to replicate within 
the host cell. 

Plasmid DNA exists as a closed ring. 
However, by appropriate techniques, the ring may 
be opened, a fragment of heterologous DNA inserted, 
and the ring reclosed, forming an enlarged molecule 
containing the inserted DNA segment. 

Transfer is accomplished by a process known 
as transformation. During transformation, host 
cells mixed with plasmid DNA incorporate entire 
plasmid molecules into the cells. Once a 
cell has incorporated a^ plasmid, the latter is 

replicated within the cell and the plasmid replicas < 

are distributed to the progeny cells when the cell 

divides . < 

Genetic information contained in the 
nucleotide sequence of the plasmid DNA, including 
heterologous DNA inserted into the plasmid, can 
in principle be expressed in the host cell. The 
inserted heterologous DNA typically representing 
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a single gene, is expressed when the protein 
product coded by the gene is synthesized by the 
organism. 

Once a gene has been isolated, purified 
and inserted into a plasmid or other vector, the 
5 availability of the gene in siabstantial quantity 
is assured. After transfer of the vector into a • 
suitable microorganism, the gene replicates as 
the microorganism proliferates. The vector 
containing the gene is easily purified from 

10 cultures of the host microorganism by knovm tech- 
niques and separable from the vector by restriction, 
endonuclease cleavage followed by gel electrophoresis. 
The protein product expressed by the heterologous 
gene can also be recovered in substantial quantities 

15 from cultures of the host microorganism by harvesting 
the culture and retrieving the protein product 
contained in the harvested cells. (For futher 
detail of recombinant DNA technology, and an explicit 
exposition of the utility of producing proteins 

20 such as hormones, etc., by recombinant DNA technology, 
see U.S. Patent No. 4,237,224, issued December 2, 1980 
to Cohen et al., and U.S. Patent No. 4,322,499, 
issued March 30, 1982 to Baxter et al. Patents 
and articles cited herein are incoinpoJ^ated by 

25 reference wherever such citations occur and shall 
be' considered incorporated in their entirety as if 

set forth in full) . . 

Recombinant DNA thus holds great promise 
for economically producing substantial quantities 

30 of useful proteins that are difficult or costly to 
isolate in such quantitites from mammalian tissue. 
A major and nearly universal problem in producing 
useful proteins, however, is the construction of 
the actual genetic material to be inserted into the 

35 transfer vector. 
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Conventional means provide for enzymatically 
preparing desired genetic material by reverse 
transcription • Mature messenger RNA (mPNA) , which 
is chemically similar to DNA and rietains most of 
5 the information coded in DNA, can be, extracted 
from tissue in which the desired gene is active. 
mRNA is separated from other RNA material in the 
tissue and coitplementary DNA (cDNA) is produced 
by the enzyme reverse transcriptase, and at times 

10 polymerase 1 for the synthesis of the second strand. 
This cDNA, a complementary copy of mBNA and ' 
similarly containing the information coded in 
RNA, is often further altered in known ways to 
be suitable for insertion into a plasmid vector. 

15 (See W. Mahoney & S. Henikoff, Univ. of Washington 
Medicine , Vol. 8, No. 4, pp. 6-14 (Winter, 1981) >. 

cDNA enzymatically prepared by reverse 
transcription has the potential to express a 
protein chain identical to the protein expressed 

20 by tissue from which the mRNA was extracted. Thiis 

alone is not sufficient, however, for the expression 
of desired mature animal proteins because many 
animal proteins, represented by such diverse 
classes as hormones, binding proteins, enzymes, 

25 antibodies, and collagen, are produced in nature 
in the foirm of larger precursors that are 
subsequently modified by cleavage to smaller 
bioactive forms commonly designated mature 
proteins. Thus, expression of cDNA synthesized 

30 by reverse transcription only has the potential 
to express the precursor of the mature protein 
product. 

It has been known for several years that 
bacteria such as coli can remove the "pre" 
35 portion of its own secreated proteins. Examples 
include the processing of pre-ribose binding 
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protein, pre-galactose binding protein and 
pre-arabinose binding protein, (L. Randall, et al., 
Eur. J. Biochem. , Vol. 92, PP. 411-41S (1978); 
L. Randall, S. Hardy, and L. Josefsson. ProC Natl. 
5 Acad. Sci. USA , Vol. 75, pp. 1209-1212 (1978)). 

S. Chan, et al., Proc. Natl. Acad. S ci. USA, 
Vol. 78, pp. 5401-5405 (1981) has exploited the 
ability of coli to remove the "pre" sequence. 
Chan, et al., modified cDNA for htnnan preproinsulin 

10 to encode a hydrid "pre" sequence containing 

portions of.E. coli and mammalian "pre" sequence. 

coli expressed the hydrid protein and correctly 
removed the "pre" sequence by intra-cellular 
processing. Thus, Chan, et al., was able to modify 

15 human preproinsiilin cDNA in a way that would allow 
E. coli to produce proinsulin. 

It is also known that yeast shares the 
ability to remove "pre" sequences from its own 
pre-proteins . Furthermore, when an coli 

20 preprotein was genetically engineered into yeast, 
pre-B-lactamase was processed to B-lactamase. 
(Roggenkamp, et al., Proc. Nat l. Acad. Sci. 
USA , vol. 78, No. 7, pp. 4466-4470 (1981)). 

The above type of processing of preproteins, 

25 however, will not process to mature proteins 
many of the mammalian hormone precursors and 
many of the other interesting mammalian protein 
precursors in coli . These latter hormone and 
protein precursors contain a "pro" portion which 

30 is not processed by the enzymatic mechanism 

responsible for processing the "pre" portion of 
preproteins. As shown above, for exan?>le, the 
natural precursor for insulin, i.e. preproinsulin 
is processed in Ej_ coli to form proinsulin. 

35 Many investigators have been unabl« to 

express pre-proteins in yeast or coli , let 
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alone get processing. Expensive cind time consuming r 
investigative efforts have focused almost 
exclusively on genetically eliminating the 
"pre" sequences and the "pro" sequences in 
5 atten^ting to express mature proteins without 
intermediates . 

In several prior art approaches, the need 
for processing precursor proteins has been over- 
come. Insulin is the result of natural processing 

10 in human tissue involving cleaving two peptide 

chains, A and B, from the single large precursor 
preproinsulin cuid assembling the A and B chains 
to form the mature hormone insulin. The A and 
B chains are located within proinsulin and hence 

15 coli which processes preproinsulin to proinsulin 

does not produce the matxare hormone insulin. An 
approach to obtaining mature insulin using 
E. coli^ employs chemically synthesized genes 
con5>atible with coli . 

20 . A doToble-stranded synthetic DNA-coding 

sequence for the insulin A chain was synthesized 
chemically from fundamental nucleotide units to 
yield the correct coding sequence. An extra 
amino acid (methionine) was added at one end. 

25 This end was fused to the bacterial gene for the 

enzyme B-galactosidase which results in accumulations 
of f\ised B-galactosidase-insulin-A-chain protein. 
This same procedure wag repeated for the B-chcd.n 
which resulted in the production of fused B-galactosi- 

30 dase-insulin-B-chain protein. 

The fused proteins are insoluble in water ^ 
and readily isolated from broken cells. The A and 
B chains of insulin are released from B-galactosidase 
at the extra methionine by cycuiogen bromide cleavage 

35 and subsequently mixed together under conditions 
that allow formation of disulfide bonds between A 
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and B chains, yielding matxire insulin. (W. Miller, 
J. of Pediatrics , Vol. 99, pp..' 1-15 (1981); 
D. Goeddel, et al., Proc. Natr .' Acad. Sci. USA, Vol 76, 
pp. 106-110 (1979) ) . 

The above prior art approach overcomes the 
need for processing a precursor protein, but in 
turn requires processing of the fused B- 
galactosidase-insulin-A-chain and B-galactosidase- • 
insulin-B-chain proteins to mature insulin. 
Moreover, chemical synthesis of the DNA coding 
sequences for A-chainand B-chain involves 
substantial costs, even when considering that 
the B-galactosidase-insulin-A-chain gene and 
B-galactosidase-insulin-B-chain gene after being 
15 synthesized are easily replicated for subsequent 
production of insulin. (D. Williams, et al.. 
Science , Vol. 215, pp. 687-689 (Feb. 1982); 
W. Mahoney, Univ. of Wash. Medicine , supra) . 

The approach of chemically synthesizing DNA 
20 encoding for mature proteins has also been shown to 
be effective for bacterial production of human 
somatostation. (K. Itakura, et al.. Science, 
vol. 198, pp. 1056-1063 (1977)). However, insulin 
chains A and B and human somatostation are 
25 relatively small sequences and chemically 

synthesized DNA coded for them are relatively 
small. In the case of larger proteins, chemical 
synthesis of the DNA coding sequence coded for 
such proteins is prohibitively time consuming. 
30 One prior art approach, now often followed, 

utilizes chemically synthesized DNA in conjunction 
with enzymatically prepared cDNA to produce a 
gene which instructs production of mature hormone 
in bacteria. Human growth hormone (HGH) is a 
35 protein of 191 amino acids, its precursor having 
an additional 26 amino acid "pre" portion. cDNA 
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encoding the precursor was enzymatically prepared 
from mRNA isolated from human pitxiitary tissue. 
The first useful cleavage site of the cDNA occurs 
at the site encoding amino acid residues 23-24 of 

5 HGH. Treatment of the cDNA with restriction 
endonuclease Hae III gives a DNA fragment of 
551 base pairs which includes coding sequences for 
amino acids 24-191 of HGH. A gene fragment 
having coding sequences for residues 1-23 of HGH 
10 (and an initiation codon) was chemically synthesized. 
The two DNA fragments were combined to form a 
synthetic-natural hybrid gene which when inserted 
into a plasmid vector directed expression of 
mature HGH in £. coli. (D. Goeddel, et al., 

15 Nature , Vol. 281, pp. 544-548 {October 1979)). 

Using a similar strategy of cleavage and 
reconstriction of DNA for the mature protein ^ 
R. Lawn et al . , Nucleic Acids Research , Vol . 9 , 
No. 22, pp. 6103-6114 (1981), expressed mature 

20 human albumin in E^^ coli. 

This general approach, however, requires 
time consTuoing chemical synthesis of desired gene 
fragments, cleavage of cDNA assuming the availa- 
bility of useful cleavage sites and difficult 

25 genetic construction of plasmids from DNA fragments. 
Furthermore, in both of the above exaixples, an 
initiator methionine was left at the NHj -terminal. 
The initiator methionines caimot practically be 
removed since HGH and albumin also have methionines 

30 located elsewhere in the sequence. Thus, removing 
the initiator methionine by cayanogen bromide 
cleavage, would res\ilt in cleavage at the other 
methionines. This would resoilt in a protein split 
into cleaved fragments. Both the HGH and albumin 

35 produced by the above approach are "matxire" 

proteins - which start with methionines . Hence they 
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are not "real" mature proteins. 

The prior art approaches set forth above 
illustrate that a major and nearly iiniversal 
problem in producing mature proteins is the con'- 
5 struction of the actual genetic material to be 
inserted into transfer vectors. Procedures 
exist for preparing cDNA from mRNA isolated from 
mammalian or other higher order animal tissue, but 
mammalian and higher order animal proteins are 

10 most often expressed as precursors and subsequently • 
processed into the mature protein in cells of 
origin. The prior art has identified coli 
and yeast as microorganisms capable of processing 
precursors containing the "pre" portion, but this 

15 class of precursors excludes many of precursors 

of interest. The prior art thus has not identified 
a microorganism suitable for cloning mammalian and 
higher order animal genes which is capable of 
processing to mature proteins precursors of 

20 greatest interest. The prior art approaches atteinpt 
to solve the problem by constructing genes that 
code for mature protein. However, although 
procedures now exist for identifying nucleotide 
coding sequences for matxire proteins / chemical 

25 synthesis of DNA sequences encoding mature 

.proteins or fragments thereof for use in hydrid 
genes is costly and time consuming, often 
prohibitively so. ^ 

BRIEF DESCRIPTION OF THE FIGURES 

30 FIG. 1 shows inferred protein cleavage 

sites within the precursor of yeast a- factor, where 
"K" designates lysine and "R" designates arginine 
amino acid residues. 

FIG. 2 shows the cDNA sequence encoding 

35 preproparathyrcid liormone and the unique Pvu II 
and Hinf 1 cleavage sites . 
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FIG. 3 shows certain portions of the 
nucleotide sequence of the pYEM-l plasmid. 

SUMMARY or TEE INVENTION . 
In the present Invention r a method is 
5 disclosed for producing protein in yeast transformed 
to express a corresponding precursor containing 
a pair or triplet of basic amino acid residues 
located proximally and/or dis tally adjacent to the 
protein portion of the precursor sequence conqprising 

10 proteolytic processing by the yeast of the 

precursor at the site of such pairs or triplets 
of basic amino acid residues* The method comprises 
proteolytic processing by transformed yeast which 
contains cin endopeptidase, designated herein as a 

15 trypsin-like enzyme or enzymes. The tarypsin-like 
enzyme or enzymes proteolytically process the 
precursor at the site of such pairs or triplets 
of basic amino acid residues by cleaving at the 
distal side of such pairs or triplets. The 

20 method further comprises proteolytic processing 

by transformed yeast that contains an exopeptidase, 
designated herein a carboxypeptidase-B-like 
enzyme or enzymes. The carboxypeptidase-B-like 
enzyme or enzymes proteolytically process the 
.25 precursor at the site of such pairs or triplets 
of basic amino acid residues by degrading- such 
pairs or triplets of basic amino acid residues 
remaining dis tally adjacent to the protein portion 
of the precursor sequence after the cleavage by 

30 the trypsin-like enzyme or enzymes. 

In the present invention, the above method 
is further disclosed for proteolytic processing of 
proto-proteins to mature proteins. Proto-proteins , 
defined with greater specificity inf^/ consist 

35 generally of precursor proteins in which the 
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protein portion of the precursor s^uence is 
identical in structure ,to .the mature protein except 
for the abscence of the amino teinninal and the 
carboxyl terminal in the precursor sequence.' .TJie 
5 above method is also disclosed for proteolytic 
processing of certain jion-proto-proteins • For 
example, the above method .is disclosed for 
proteolytic processing of preproinsulin or 
proinsulin to mature ins-ulin. The above method is 

10 disclosed for producing mammalian insulin 

generally as well as hiiman, bovine, and porcine 
insulin specifically. - According to the method, . 
preprocalcitonin and procalcitonin may be 
proteolytically processed by transformed yeast 

15 to form mature calcitonin or a calcitonin relative 
in the case of animal calcitonin generally and 
htiman, bovine, and porcine calcitonin specifically. 

In the present invention, a recombinant 
DNA plasmid transfer vector useful for transforming 

20 yeast comprising a DNA sequence comprising. the 

preproparathyroid gene cDNA sequence is disclosed 
as well as the plasmid pYEM-1 and yeast transformed 
by a plasmid comprising the above transfer vector 
and yeast transformed by the plasmid pYEM-1. 

25 DESCRIPTION OF THE SPECIFIC EMBODIMENT 

Proto-proteins may consist of precursors 
for which DNA and mRNA encoding the precursors 
naturally occur in animals. This type of 
proto-protein is designated source natural 

30 proto-proteins. Proto-proteins may also consist 
of precursors in which synthetic DNA encodes the 
precursor. This type of proto-protein is designated 
source synthetic proto-protein. For example, by 
chemical synthesis, or alternatively by enzymatic 

35 cleavage, rearrcingement and subsequent fusion, DNA 
can be synthesized so that the precursor which 



WO8401 1 73 .[file:/A\dcwas03\fifrndaM^^ 1 73.CPC3 

WO 84/01173 



Page 1 4 of 2 9 

PCTAIS83/P1361 



-12- 

it encodes has the cleavage properties discussed 
below. Production of mature protein might be 
enhanced by transforming yeast with synthetic DNA 
encoded for a precursor having repetitive sequences 
5 of the mature protein, each sequence being flanked 
by appropriate cleavage sites. 

Source natural proto-proteins are illustrated 
by, but not limited to, certain hormone precursors, 
including preproparathyroid (J. Habener & J. Potts, 

10 The New England Journal of Medicine (Second Part) , 
Vol. 299, No. 12, pp. 635-643 (Sept. 1978)), 
preprosomatostatin (P. Hobart, et al.. Nature , 
Vol. 288, pp. 137-139 (November 1980)), AVP-NpII 
precursor to arginine vasopressin and its 

15 corresponding neurophysin (H. Land, et al.. Nature, 
Vol. 295, pp. 299-303 (January 1982)), cortitropin 
B-lipotropin precursor to corticotopin (ACTH) 
and B-lipotropin (B-LPH) (S. Nakanishi, et al.. 
Nature, Vol. 278, pp. 423-427 (March 1979)), 

20 preproglucagon (P. Lund, et al.. Pro. Natl. Acad. 
Sci. USA , Vol. 79, pp. 345-349 (January 1982)), 
and pro-opiomelanocortin (POMC) precursor to 
B-endorphin and Met- and Leu-enkephalin precursor 
(M. Comb, et al.. Nature, Vol. 295, pp. 663-6€6, 

25 (February 1982)). 

Source natural proto-proteins are also 
illustrated by melittin precursor (G. Suchanek, 
et al., Eur. J. Biochemistry , Vol. 60, pp. 309-315 
(1975); G. Suchanek, et al., Proc. Natl. Acad. 

30 Sci. USA , Vol. 75, pp. 701-704 (1978)) and serum 
albumin precursors (R. Lawn, et al.. Nucleic Acids 
Research , Vol 9, No. 22, pp. 6103-6114 (1981)). 

As reported in the above citations, these 
precursors contain within their sequence at least 

35 one mature protein sequence. Where there is a 
single mature protein sequence contained in the 
precursor it is flanked proximally by a pair or 
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triplet of basic amino acid residues consisting* 
of lysine and/or arginine and is flanked distally 
by either the carboxyl -terminal of the precursor 
or a pair or triplet of basic amino acid residues 
5 lysine and/or arginine. If there are several mature 
protein sequences contained in the precursor/ at 
least one. of the mature protein sequences is 
flanked proximally by a pair or triplet of such 
basic amino acid residues and is flanked distally 

10 by either the carboxyl -terminal of the precursor 
or a pair or triplet of such basic amino acid 
residues. Any precursor protein falling within 
this description is defined herein as a proto-protein, 
whether it be source natural or source synthetic. 

15 As reported in the above citations in 

connection with observing the production of mature 
proteins in mammals and other higher order animals, 
the cleavage site located on the distal side of 
a pair or triplet of such basic amino acid residues 

20 is readily attacked by endopeptidases with 
trypsin-like activity. After endopeptidase 
cleavage, any residual basic residues remaining 
adjacent to and on the distal side of the mature 
protein are susceptible to degrading, i.e. 

25 selective removal, by exopeptidases with activity 
resembling that of carboxypeptidase-B . 

Thus, for example, in prepropara thyroid 
hormone the mature protein is flanked proximally 
by the basic triplet lysine-lysine-arginine and is 

30 flanked distally by the carboxyl-terminal of the 
precursor. A single cleavage by a trypsin-like 
enzyme is sufficient ±0 produce the mature hormone. 
In other proteins such as the glucagon precursor, 
two mature glucagon proteins are flanked both 

35 proximally and distally by a basic pair lysine- 
arginine. Combined cleavage by a trypsin-like 



CMfl 



WO8401 1 73.[file:/A\ dcw a s03 \firmdata\lp \Fo leyPat\ Pa te ntD ocumen ts\W0 8 401 1 7 3.C PC1 



"WO 84/01173 



PCTAJS83/P1361 



Page 16 of 29 



-14- 

enzyme and degradation of the resulting carboxyl- 
tenninal by a carboxypeptidase-B-like enzyme are 
required to produce the mature proteins . 

The method of the present invention comprises •, 
5 preteolytic processing by yeast of proto-proteins 
to mature proteins. In the method, transformed ' 
yeast naturally containing a trypsin-like enzyme 
or enzymes and a carboxypeptidase-B-like enzyme 
or enzymes , proteolytically release mature proteins 

10 from larger precursors. These enzymes will effectively 
cleave and degrade proto-proteins to mature proteins. 
This is confirmed by a trypsin-like cleavage, 
discussed infra, of prepropara thyroid hormone 
yielding mature parathyroid hormone. This is 

15 further confirmed by yeast processing its own 
mating factor, a-f actor. (T. Tanaka, et al., 
J. Biochemistry , Vol. 82, pp. 1681-W87 (1977)). 
As shown in FIG. 1, the nucleotide sequence of 
a-factor shows that yeast naturally expresses a 

20 precursor containing four distinct codings for 
mature a-factor. Three of the four a-factors 
in the precursor are flanked distally by a pair 
of basic amino acids residues. A trypsin-like 
cleavage in combination with a carboxypeptidase- 

25 B-like degrading naturally yields coirrectly 

processed C-termini for these three . a-f actors . 
After a trypsin-like cleavage, N- termini of the 
four a-factors are flanked proximally by a series 
of several glutamic acid and alanine amino arid 

30 residues. These latter residues are in turn removed 
by an aminopeptidase. The foregoing natural 
endopeptidase and exopeptidase activity in yeast 
in combination with the virtual uniform presence of 
pairs and triplets of lysine and/or arginine 

35 flanking mature hormone sequences in proto-proteins 

underlies the present invention. 
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Although pr.eproinsulin and proinsulin cohtaining 
disulfide bonds are not proto-proteins as defined 
herein they will nevertheless undergo proteolytic 
processing in yeast transformed to express the 
5 preproinsuiin or proinsulin. A pair or triplet of 
basic amino acid residues are located distally 
and/or proximally adjacent to the insulin-A- 
chain and the insulin-B-chain portions of the 
sequence which constitute the protein portion of 

10 the precursor preproinsuiin and proinsulin. 

sequence. The requisite disulfide bonds between 
the insulin-A-chain portion of the sequence and 
the insulin-B-chain portion of the sequence will 
be formed in yeast, (cf. the numerous examples 

15 of disulfide bond formation in yeast disclosed in 

M. Dayhoff, Atlas of Protein Sequence and Structure, 
Vol. 5 and Supplements 1, 2 & 3 (National Biomedical 
Research Foundation, Georgetown University Medical 
Center, Washington, D.C, 20007 (1972, 1973, 1976, 

20 and 1981) ) ) . Proteolytic processing at the site 
of such pairs or triplets of basic amino acid 
residues, twill yield mature insulin from preproinsuiin 
or proinsulin containing the disulfide bonds. 

In the absence of disulfide bond formation 

25 between the insulin-A-chain portion of the sequence 
and the insulin-B-chain portion of the sequence, 
proteolytic processing will yield insulin-A-chain 
and insulin-B-chain, which may be caused in turn 
to attach to one another by disulfide bonds by 

30 conventional means to form mature insulin. In 

this case, the insulin-A-chain and insulin-B-chain 
may be considered mature proteins and preproinsuiin 
and proinsulin without disulfide bonds may be 
considered a proto-protein according to the above 

35 discussion of proto-proteins. 
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Mature calcitonin contains disulfide bonds 
between the cysteines located at positions 1 and 
7 of the sequence, contains a carbohydrate attached 



10 



15 



20 



25 



30 



at the sequence at position 3, and the proline 
at position 32 has been amidated to pro-amide 
while the glycine at position 33 has been removed, 
PreprocaLc itonin and procalcitonin will contain 
the requisite disulfide bonds, (of, the numero^lS 
examples of disulfide bond formation in yeast as 
disclosed in Dayhoff , supra ) • A carbohydrate 
will be attached at position 3 in calcitonin. 
Preprocalcitonin and procalcitonin will undergo 
proteolytic processing in yeast transformed to express 
the preprocalcitonin or procalcitonin, A pair 
of basic amino residues are located proximally 
adjacent to the 33 amino acid sequence, while a 
triplet is located distally adjacent to the 33 
amino acid sequence. It is expected that 
amidation of the proline located at 32 will 
occur in yeast after the cleavage distall to and 
degradation of the triplet, (cf . numerous exairples 
of amidation in yeast as disclosed by Dayhoff , 
supra ) . In the event that a carbohydrate 
differing from the carbohydrate .of mature calcitonin 
is formed by the yeast, the calcitonin relative 
containing the differing carbohydrate may be 
converted to mature calcitonin by conventional 
means . In the event that eunidation following 
cleavage and degradation is suppressed, the 
calcitonin relative lacking the amidation may also 
be converted to mature calcitonin by conventional 
means. 

By reverse transcription, cDNA can be 
prepared encoding any proto-protein of interest by 
isolating mHNA from tissues expressing the protein. 
Although many hoannone cind other protein genes have 
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already been cloned in coli, yeast has heretofore 
not been the host of choice. cDNA not previously 
cloned in yeast can be rendered compatible with a 
yeast host by proper codon selection (J . Bennetzen & 
B. Hall, J. Bio Chem. / Vol. 257, pp. 3026 (1982)) 
and by site specific mutagenesis of the cDNA 
(G. Simmons, et ai - . Nucleic Acid Research, 
Vol. 10, ppi 821 (1982)). 

Thus, one of the fundamental problems with 
producing useful mature proteins by recombinant 
DNA techniques has been siii?>li^ied in the case of 
mature proteins derived f rom proto-proteins . 
CDNA, although readily available for most proteins 
by reverse transcription of mRNA isolated from 
15 animal tissue, will express the precursor of the 
mature protein. Yeast, but not cold?, has the 
requisite enzymes to process expressed proto-proteins, 
preproinsulin, or proinsulin to mature protein or 
insulin . 

20 EXPERIMENTAL 

In order to demonstrate the present invention, 
the following experiment was carried out. 

The plasmid YEp-13 was obtained from 
Dr. Steven Henekoff , Fred Hutchinston, Dept. of 
25 Developmental Biology, Seattle, Washington, and can 
be constructed according to J. Broach, et al . , 
Gene , Vol. 8, pp. 121-133, (1979). The gene which 
encodes yeast alcohol dehydrogenase 1 was modified 
according to Hitzelman,' et al.. Nature (London), 
Vol. 293, pp. 717-722 (1981) , allowing the 
isolation of the transcription signals. These 
sequences, including the cloning site, were 
provided by Dr. G. Amroera. The plasmid YEp-13 
was modified so that the tet^ gene of YEp-13 was 
interrupted at the Bam HI site with the yeast alcohol 
dehydrogenase 1 gene promoter and RNA polymerase 



30 



35 
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stop sequences . A Hind III site between the 
latter two elements provided the cloning site. 
These modifications of plasmid YEp-13 were 
accomplished by methods set forth generally in 
5 U.S. Patent 4,237,224,' supra , and Methods of 

• Enzymology , Vols. 65 and 68 wherein such methods* 
are reviewed. 

The cDNA sequence coding bovine prepropara- 
thyroid hormone, shown in PIG. 2 and further 

10 described in B. Kemper, et al Hormonal Control 
of Calcium Metabolism (Ed. by D. Cohn, et al., 
published Excerpta Medica at Amsterdam, Oxford, 
and Princeton 1981) at pp. 19, was obtained from 
Dr. Byron Kemper, Department of Physiology and 

15 Biophysics cuid School of Basic Medical Sciences, 

University of Illinois-Urbana. This cDNA sequence 
was restricted with the enzymes PVU II and Hinf 1 
at the sites shown in FIG. 2. These enzymes were 
obtained from New England Biolaboratories , Beverly 

20 MA. The Hinf 1 site shown in PIG. 2 was filled with 
nucleotides using the enzyme DNA polymerase 1 
(the large fragment) which was obtained from 
New England Nuclear, Boston, MA. This modified 
sequence was then blunt-end ligated to Hind III linkers 

25 and restricted with the enzyme Hind III. The 

Hind III linkers and Hind III enzyme were obtained 
from New England Biolaboratories, supra , The 
resulting DNA fragment was then ligated into -the 
Hind III site of the modified plasmid YEp-lS 

30 forming a novel plasmid. This plasmid was designated 
pYEM-1. FIG. 3 shows certain portions of the 
nucleotide sequence of pYEM-1. The foregoing 
construction of pYEM-1 was accomplished by methods 
set forth generally in U.S. Patent No. 4,237,224, 

35 supra , the BLR M13 handbook, and Methods of 

Enzymology , Vols. 65 and 68 wherein such methods are 
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reviewed* 

After constructing pYEM-1, yeast cells 
were transformed with the plasmid using the methods 
of Beggs, Nature (London) , Vol. 27,5, pp- 104-109 
(19.78) and Hinnen, et' al>/ Proc. Natl . Acad/ of 
Sci. USA , Vol. 75, pp.; 1929-1933 (1978). Because 
pYEM-1 has the yeast leu 2 gene, the use of a 
leu 2 negative strain of yeast was used in the 
transformation for the purposes of selecting 
successful trans formants. Yeast strain, X106.9-2D, ' 
a strain of Saccharomyc'es; cerevisiae defective 
in leu-2 function, was obtained from the Yeast 
Genetic Stock Center,. Univ. of Calif orinia- 
Berekeley. 

Of course any other defective yeast strain, 
including strains within' Sa<:charomyces poimbe and 
other species, could be used. All that is required 
is that a complementation system be established 
between the yeast strain and the cloning/expression 
vector and that the vector be stabily maintained . 
in yeast. For exairple, a Trp 1 strain could be 
used if the Trp 1 gene was on the vector. To 
date, several stable transformation systems have 
been described. (A. Hinnen and B. Meyhack, 
Current Topics in Microbiology and Immunology , 
Vol. 96, pp. 101-117 (1981); C. Hollenberg,' 
Current Topics in Microbiology and Immunology , 
Vol. 96, pp. 119-144 (f981)). 

The transformed yeast cells containing 
plasmid pYEM-1 were grown in a leucine deficient 
media containing 5% glucose, yeast extract, 
yeast nitrogen base and other nutrients suitable 
for yeast strain X1069-2D. After 24 hours of 
growth at 30 "'C, the media was collected and the 
yeast cells lysed. Bioassay was performed 
according to conventional ^chniques and PTE 
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radioiitununoassay was performed using Iiranuno 
Nuclear Corporation (Stillwater, MN) assays 
specific to the N-terminal, mid-molecule, and 
C-terminal regions of parathyroid hormone. The 
following table shows that both immunologically 
cross-reactive parathyroid hormone and biologically 
active parathyroid hormone is being produced in 
yeast. 



10 



TABLE 



15 



PTH PTE PTH 
N- Mid- C- 
terminal molecule terminal Bioassay* 
RIA* region RIA* RIA* ' 



Cell lysate 
pYEM-1 
control 

Media 

pYEM-1 
control 



16 
0 



2 
0 



16 
0 



2 
0 



16 
0 



2 
0 



20 



*expressed in nanomoles/ml 



10 
0 



0.015 
0 



To confirm that correct processing had 
occurred, 50 ml of culture was prepared in which 
the parathyroid hormone producing yeast were 

25 grown in media containing ^^S methionine 

(80 u ci/ml) . After an overnight growth the 
cells were removed by centrifugation. The media 
was then incxabated with specific N-terminal 
parathyroid hormone antibody. After two hours 

30 the antibody-antigen complex was recovered by 

centrifugation and washed three times with new 

media followed by an ether wash. This complex 

35 

contained about 7,000 cpm of S methionine 
incorporated into protein after TCA precipitation, 
35 This mixtTire was applied to a Beckman 890D 



WOa401173 ?f ile: /A\dcw as03\firm data\lp\FoleYPat\PatentDocuments\WOa401 17 3 .CPC3 



WO 84/01173 



PCT/US83/01361 



Pa ge 23 of 29 



-21- 



10 



15 



20 



sequencer according to the methods " of Mahoney • 
and Nute, Biochem . Vol. 19, pp. 4436 (1980V and 
subsequently degraded 40 cycles. Sequiance analysis 
denionstrated that the "^^S methionine was all 
contained in cycles number 8 and 18. In mature 
PTH, methionine appears only at positions 
8 and 18 in the sequence. If preproparathyroid 
hormone expressed by the yeast was left unprocessed, 
we would expect S methionine in cycles 1, 2, 
7, 11, 14, 49, and 59 reflecting the appearance 
of methionine at positions -31, -30, -25, -21, 
-18, +8, +18 in the preproparathyroid sequence. 

The novel microorganism . yeast strain 
Xip69-2D transformed by novel plasmid pYEM-1, 
designated Xl069-2D-pYEM-l, was placed on permanent 
deposit in the Northern Regional Research Center, 
U.S. Dept. of AgricultTire, Peoria, Illinois 
61604 on September 8v 1982. The NRRL number for 
X1069-2D-PYEM-1 is ¥-15153. The plasmid pYEM-l 
and the transfer vector contained therein may be 
removed from this novel yeast strain by known 
means . 

While the invention has been described in 
connection with a specific embodiment thereof, it 
will be understood that it is capable of further 
modifications and this application is intended to 
cover any variations uses, or adaptations of the 
invention within the scope of the appended claims . 




WOB401 173 Jfi ie:/A\dcwas03\firmda ta\lp\ Fo ley Pa t \Pa te ntD oc uments\W084 011 73 - CP C] ^ Pag 

WO 84/01173 PCTAJS83/01361 

-22- 
CLATMS 

1. A method for producing protein in yeast 
transformed to express a corresponding precvirsor 
containing a pair or triplet of basic amino acid 
residues located proximally and/or dis tally adjacent 
to the protein portion of the precursor sequence, 
comprising proteolytic processing by the yeast 
of the precursor at the site of such pairs or 
triplets of basic amino acid residues ♦ 

2. The method of claim 1 wherein the proteolytic 
processing by yeast of the precursor at the site 
of such pairs or triplets of basic amino acid 
residues comprises cleaving, by a trypsin-like 
enzyme or enzymes present in the transformed yeast, 
at the distal side of such pairs or triplets of 
basic amino acid residues. 

3. The method of claim 2 wherein the proteolytic 
processing by yeast of the precursor at the site 
of such pairs or triplets of basic amino acid 
residues further comprises degrading, by a carboxy- 
peptidase-B-like enzyme or enzymes present in the 
transformed yeast, of any such pairs or triplets 
of basic amino assay resid\:es remaining distally 
adjacent to the protein portion of the precursor 
sequence- after the cleavage by the tarypsin-like 
enzyme or enzymes. 

4. The method of claim 1 wherein the corresponding 
precxirsor is a proto-protein. 

5. The method of claim 4 wherein the proto- 
protein is source synthetic proto-protein. 
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6. The method of claim 4 wherein the proto- 
protein is source natural proto-protein • 

7. The method of claim 6 wherein the source 
natural proto-protein is bovine prepropara thyroid 
hormone. 

8. The method of claim 1 wherein the protein 

is mammalian insulin and the corresponding precursor 
is mammalian preproinsulin or proinsulin. 

9. The method of claim 8 wherein the 
mammalian insulin and mammalian preproinsulin 
or mammalian proinsulin are members respectively 
of the group consisting of human insulin and 
hioman preproinsulin or human proinsulin, 
bovine insulin and bovine preproinsulin or 
bovine proinsulin, and porcine insulin and porcine 
preproinsulin or porcine proinsulin. 

10. The method of claim 1 wherein the protein 
is animal calcitonin or an animal calcitonin 
relative and the precursor is animal preprocalcitonin 
or animal procalcitonin. 

11. The method of claim 10 wherein the animal 
calcitonin or animal calcitonin relative and the 
animal preprocalcitonin or animal procalcitonin 
are members respectively of the group consisting 
of human calcitonin or hioman calcitonin relative 
and hxaman preprocalcitonin or human procalcitonin, 
bovine calcitonin or bovine calcitonin relative 

and bovine preprocalcitonin or bovine procalcitonin, 
and porcine calcitonin or porcine calcitonin 
relative and porcine preprocalcitonin or pro- 
calcitonin. 
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12. The method of claim' 1 wherein the yeast 
is Saccharomyces cerevisiae or Saccharomyces 
pombe. 

13. The method of claim 12 wherein the yeast 
is Saccharomyces cerevisiae. 

14. A recombinant DNA plasmid transfer vector 
useful for transforming yeast comprising a 

DNA sequence comprising the preproparathyroid 
genecDNA sequence. 

15. The plasmid pYEM-1 

16. Yeast transformed by a plasmid comprising 
the transfer vector of claim 14. 

17. Yeast transfoanaed by the plasmid of 
claim 15. 
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.^-CTblack borders 

□ image cut off at top, bottom or sides 

□ faded text or drawing 
i^blurred or illegible text or drawing 

□ skewed/slanted images 

□ color or black and white photographs 

□ gray scale documents 

^^QTlINES or marks on ORIGINAL DOCUMENT 

□ REFERENCE(S) or EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: ■ 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



